Search for single top quark production in pp collisions at ^ l fs = 1.96 TeV in the 
missing transverse energy plus jets topology 
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We report a search for single top quark production with the CDF II detector using 2.1 fb _1 
of integrated luminosity of pp collisions at ^/s = 1.96 TeV. The data selected consist of events 
characterized by large energy imbalance in the transverse plane and hadronic jets, and no identified 
electrons and muons, so the sample is enriched in W — > rv decays. In order to suppress backgrounds, 
additional kinematic and topological requirements are imposed through a neural network, and at 
least one of the jets must be identified as a 6 quark jet. We measure an excess of signal-like events 
in agreement with the standard model prediction, but inconsistent with a model without single 
top quark production by 2.1 standard deviations (cr), with a median expected sensitivity of 1.4 a. 
Assuming a top quark mass of 175 GeV/c 2 and ascribing the excess to single top quark production, 
the cross section is measured to be 4. 9^2 2 (stat+syst) pb, consistent with measurements performed 
in independent datasets and with the standard model prediction. 
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PACS numbers: 14.65. Ha, 13.85.Ni, 12.15.Hh, 



I. INTRODUCTION 

At the Tevatron the dominant standard model (SM) 
mechanism for top quark production in pp collisions 
is the production through strong interactions of ti 
pairs with a cross section of approximately 7.0 pb 
Top quarks can also be produced singly through elec- 
troweak processes, which are interesting in their own 
right. The single top quark production cross section 
is directly proportional to the square of the magni- 
tude of the | V t b | element of the Cabibbo-Kobayashi- 
Maskawa (CKM) matrix @, 0] . A measurement of the 
single top quark production cross section thus con- 
strains the value of the latter. A value of \V t b\ smaller 
than unity could thus indicate the presence of a fourth 
family of quarks Q , while on the other hand an ap- 
parent | Vtb | value significantly greater than one could 
point, for instance, to the existence of a heavy VF-like 
boson enhancing the cross section. A review of new 
physics models affecting the single top quark produc- 
tion cross section is given in Ref. [5j . 

At the Tevatron, a single top quark can be produced 
at leading order (LO) together with a b quark in the s- 
channel, or paired with a light quark in the i-channcl 
(the charge conjugated process is assumed through- 
out). The i-channel process also has a large next-to- 
leading-order (NLO) contribution which gives an ad- 
ditional b quark in the final state. The Feynman dia- 
grams for the above processes are shown in Fig. [I] The 
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FIG. 1: Feynman diagrams for single top quark produc- 
tion. Represented are the LO (a) and NLO (b) f-channel 
processes, and the LO s-channel process (c). 



SM NLO calculations predict the single top quark pro- 
duction cross section to be a s = 0.88±0.11pb for the 
s-channel and a t — 1.98 ± 0.25 pb for the t-channel [H- 
[H, for an assumed top quark mass of 175 GeV/c 2 . 

The top quark has a predicted lifetime of roughly 
10 _25 s, and the SM predicts it decays into a W bo- 
son and a b quark almost 100% of the time, assuming 
I Vtb | 2 > \V ts \ 2 + \V t d\ 2 ■ The W boson subsequently 
decays to either a quark-antiquark pair or a lepton 
pair. Events with decays W — > ev and W — > \iv are 
the favored identification modes at a hadron collider 
due to the presence of the charged lepton and large 
missing transverse energy from the neutrino. The 
identification of both objects suppresses the otherwise 
large QCD background. In single top quark searches, 
VF — y tv decays have been studied until now only 
in events in which the tau decays to e or /i. Elec- 
troweak production of top quarks is difficult to isolate 
due to the low cross section and large backgrounds 
which can be estimated only with large uncertainties 
on their rates. 

The first evidence of electroweak top quark produc- 
tion has been achieved by the DO collaboration with 
the charged lepton plus missing energy plus jets sig- 
nature using 0.9 fb -1 of integrated luminosity [{j. flOf . 
A more recent measurement in the same decay mode 
with an observed significance of 3.7a has been released 
by the CDF collaboration using 2.2 fb -1 of integrated 
luminosity [ll| . In March 2009, both collaborations 
achieved the observation (5.0 a significance) level, us- 
ing 2.3fb~ 1 and up to 3.2 fb -1 of integrated luminos- 
ity for the DO collaboration [l2[ and the CDF collab- 
oration respectively. To add acceptance to the 
dataset with identified e or fj,, the latter measure- 
ment uses for the first time events containing jets, 
large missing transverse energy, and no reconstructed 
electrons or muons. The analysis of these events is 
described in this paper. 

This signature comprises events with W — > tv de- 
cays where the hadronic t decays are dominant, and 
with W — > ev or W — > \w decays where the e, /i are 
unidentified. Because the event selection vetoes on the 
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presence of a reconstructed e or /i this measurement 
is statistically independent from the one in Ref. [TTj . 
The single top quark candidate events analyzed share 
the same signature as events where the SM Higgs bo- 
son is produced in association with a W or Z boson, 
where the W decays leptonically in either hadronic rs 
or unidentified e or /i, or the Z decays to neutrinos. 
The techniques used in this single top quark search are 
shared with the ones deployed in the SM Higgs boson 
search in the same signature [l4[ • Since the single top 
quark events are a background to the SM Higgs bo- 
son search, measuring the single top quark production 
cross section in this sample means getting a step closer 
to reaching sensitivity to the SM Higgs boson signal. 

With respect to the single top quark search in the 
sample with identified leptons jll] this analysis has 
the challenge of much larger backgrounds masking 
the presence of the signal. In fact, the sample un- 
der study is dominated by QCD multijet production, 
where a mismeasurement of one or more of the jet en- 
ergies yields large energy imbalance in the detector. 
The QCD background dominates the signal by 4 or- 
ders of magnitude after the application of the online 
trigger selection. Also, the absence of reconstructed 
charged leptons and the presence of the neutrino re- 
sult in underconstrained kinematics, leaving us with 
the impossibility to reconstruct the top quark invari- 
ant mass from its decay products. To improve the 
signal-to-background ratio (s/b), we select jets identi- 
fied as originating from b quarks using 6-tagging algo- 
rithms. Even after these requirements, the s/b is still 
too low to achieve sensitivity to single top quark pro- 
duction. We further exploit the kinematic and topo- 
logical characteristics of single top quark events using 
neural networks to isolate the signal from the domi- 
nant QCD background and subsequently from the re- 
maining backgrounds. 

We report results based on data taken with the CDF 
II detector between July 2002 and August 2007, cor- 
responding to an integrated luminosity of 2.1 fb _1 . 
The paper is organized as follows. SectionlHl contains 
a brief description of the CDF II detector. The ana- 
lyzed dataset is described in Sec. lIIIl Section|IV] con- 
tains the definition of the reconstructed objects used 
in this analysis. In Sec. [V] we discuss the Monte Carlo 
simulation and data-based techniques we use to model 
the signal and the backgrounds. Section lVTl describes 
the preselection and the neural network event selec- 
tion used to suppress the dominant QCD background. 
Section lVTll describes the distribution used to scan for 
a signal, while Sec. lVIIll lists the sources of systematic 
uncertainties that affect the final result. Section llXl 
presents the likelihood procedure used to measure the 
cross section and the |Vt&| matrix element. The single 
top quark production cross section measured in miss- 
ing transverse energy plus jets events is presented in 
Sec.|X] Finally, results are summarized in Sec.lXIl 



II. THE CDF II DETECTOR 

The CDF II detector [l5| is an azimuthally and 
forward-backward symmetric apparatus designed to 
study pp collisions at the Fermilab Tevatron. It uses 
a cylindrical coordinate system as described in 16]. 
It consists of a magnetic spectrometer surrounded by 
calorimeters and muon detectors. The charged par- 
ticle tracking system is contained in a 1.4 T solenoid 
in which the magnetic field is parallel to the beam. 
A set of silicon microstrip detectors provides charged 
particle tracking in the radial distance from 1.5 to 
28 cm [l7l - [l9j . A 3.1 m long open-cell drift chamber, 
the central outer tracker (COT) [2(j, covers the ra- 
dial distance from 40 to 137 cm. The COT provides 
up to 96 measurements of the track position with al- 
ternating axial and ±2°-stereo superlayers of 12-wire 
layers each. The fiducial region of the silicon detec- 
tor extends in pseudorapidity |ry| up to |ry| < 2, while 
the COT provides full radial coverage up to |ry| < 1. 
Segmented electromagnetic and hadronic calorimeters 
surround the tracking system and measure the energy 
of interacting particles. 

The electromagnetic and hadronic calorimeters arc 
lead-scintillator and iron-scintillator sampling devices 
respectively, covering the range \r)\ < 3.6. They are 
segmented in the central region (|ry| < 1.1) in tow- 
ers of 0.1 in rj and 15° in azimuthal angle 0, and 
in the forward region (1.1 < \rj\ < 3.6) in towers 
of 0.1 to 0.64 units of r\ (corresponding to a nearly 
constant 2.7° change in polar angle) and 7.5° in az- 
imuthal angle for 1.1 < \rj\ < 2.11 and 15° for 
| r\ | > 2.11. The electromagnetic calorimeters [2lT l22l| 
are instrumented with proportional and scintillating 
strip detectors that measure the transverse profile of 
electromagnetic showers at a depth corresponding to 
the expected shower maximum. The measured en- 
ergy resolution for electrons in the electromagnetic 
calorimeters are 13.5%/v^r © 2% in the central re- 
gion, and 16%/V-Et © 1% in the forward region (23j 
where the units of Et are in GeV. We also measure the 
single-particle (pion) energy resolution in the hadronic 
calorimeters to be 50% /-JEt © 3% for the central and 
80%/ v / ^©5%, 
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for the forward detector 
Drift chambers located outside the central hadronic 
calorimeters and behind a 60 cm thick iron shield de- 
tect muons with \r)\ < 0.6 [25j]. Additional drift cham- 
bers and scintillation counters detect muons in the 
region 0.6 < \rj\ < 1.5 26]. Gas Cherenkov counters 
with a coverage of 3.7 < \r/\ < 4.7 measure the aver- 
age number of inelastic pp collisions and thereby de- 
termine the luminosity [271 ] . 



III. £ T PLUS JETS DATASET 

The events of interest are those with a top quark 
produced in association with one or more jets, where 
the top quark decays to a & quark and a W boson, and 
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the W boson decays leptonically. Since we are looking 
at events with non-identified electrons and muons, or 
hadronically decaying taus, we use a trigger devised 
to select events on the presence of two calorimetric 
jets and large missing transverse energy. The miss- 
ing transverse energy $t is calculated as the negative 
vector sum of the energy in each calorimeter tower 
multiplied by a unit vector in the azimuthal direction 
of the tower. The $t symbol is used for the magni- 
tude of $t- 

CDF uses a three-level trigger system, the first two 
consisting of special purpose electronics and the third 
level consisting of conventional computer processors. 
For triggering purposes the calorimeter granularity is 
simplified to a 24 x 24 grid in 77 — <fi space and each 
trigger tower spans approximately 15° in <j> and 0.2 
in 77 covering one or two physical towers. At level 1, 
$t > 25 GeV is required, while at level 2 we require 
the presence of two calorimetric clusters, each with 
transverse energy greater than 10 GeV. Finally, at 
level 3 $t > 35 GeV is required. With increasing in- 
stantaneous luminosity delivered by the Tevatron col- 
lider, tighter constraints at trigger level were needed. 
Starting from March 2005, the level 2 trigger definition 
changed by demanding that one of the two calorime- 
ter energy clusters be central, i.e. \r)\ < 1.1. The 
steadily increasing Tevatron performance required an 
additional change in November 2006, where the trig- 
ger path was turned off as soon as the initial lu- 
minosity exceeded 1.90 x 10 32 cm -2 s _1 . Starting 
from April 2007, for instantaneous luminosity above 
2 x 10 32 cm' 2 s _1 events were randomly discarded 
based on a scaling factor between 1 and 40 to keep 
the trigger rate at a reasonable level. 

Overall, 14 963 805 events pass the online trigger 
requirements, corresponding to an average effective 
cross section for the data collected by the trigger 
around 10 nb. The single top quark production cross 
section times W — > Iv branching ratio is about 1 pb, 
further reduced by the fact that this analysis is de- 
vised to collect mostly W —> tv decays and only a 
fraction of the W — > e//i v decays. The s/b ratio for 
events surviving the trigger level selection is thus of 
the order of 1/10000. 

The i£r plus jets trigger efficiency is computed us- 
ing data collected with a high pt muon, and with 
data collected with a trigger requiring the presence of 
a jet with Et > 20 GeV. Trigger efficiencies are calcu- 
lated for all three levels of the CDF trigger, and are 
then parametrized as a function of $t and Et of the 
jets. The systematic uncertainties originating from 
the choice of the samples used in the efficiency calcu- 
lations are large at small j£r, and therefore we require 
every event to have &t >50 GeV. The trigger is nearly 
100% efficient if the jets with the highest and sec- 
ond highest transverse energies satisfy the conditions 
E% >35GeV and >25GeV respectively. Addi- 
tionally, we require the spatial separation between the 
two leading Et jets to be greater than AR = 1, where 



AR — \J At] 2 + Acj) 2 is the distance in the 77 — <j> space 
between the two jet centroids, in order to avoid jet 
merging performed by the level 2 jet clustering algo- 
rithm. A small fraction of events do not pass the 
level 1 requirements due to a hardware problem, and 
are recovered with a trigger on inclusive jets [28j . 



IV. EVENT RECONSTRUCTION 

Events are considered whenever the primary event 
vertex is reconstructed inside the luminous region 
(|z| < 60 cm) along the beam axis. Jets are iden- 
tified using a fixed-cone algorithm which loops over 
calorimetric towers, with a cone radius of 0.4 in 77 — cj> 
space. The jet energies are corrected for variations in 
calorimeter response and the presence of multiple pp 
collisions. First, we take into account calorimeter re- 
sponse variations in 77 and over time, and energy loss 
in the uninstrumented regions. After a small correc- 
tion for the extra energy deposited inside the jet cone 
by multiple collisions in the same accelerator bunch 
crossing, a correction for calorimeter non-linearity is 
applied so that the jet energies correspond to the most 
probable in-cone hadronic energy. Each of these steps 
has an individual systematic uncertainty that is added 
in quadrature to derive the total uncertainty which 
decreases from 8% for jet transverse energies around 
15 GeV down to 3% for jet energies above 60 GeV. 

After these corrections the jet energy provides a 
good estimate of the initial parton energy. This is ver- 
ified by transverse momentum balance in events with 
a single jet recoiling against a well measured probe ob- 
ject such as a prompt photon oraZ^ £ + £~ 29]. Jet 
energies are further corrected using the algorithm de- 
veloped by the HI collaboration [30] , which combines 
the measurement of the momentum of charged parti- 
cles in the spectrometer with the calorimeter energy 
measurement. 

In order to improve the s/b ratio, we exploit the 
heavy flavor content of single top quark events using a 
fe-tagging algorithm based on secondary vertex recon- 
struction (secvtx) as described in detail in Ref. f3lj ]. 
The algorithm aims at the identification of jets con- 
taining a 6-hadron by reconstructing its decay ver- 
tex with at least two good quality tracks with hits in 
the silicon vertex detector. A 6-tagged jet must have 
a secondary vertex displaced from the primary ver- 
tex by more than 7.5 ctvtx in the transverse plane, 
where <tvtx = 190 /im is the typical secondary vertex 
spacial resolution in the transverse plane. A second 
algorithm, jetprob, is also used to identify jets orig- 
inating from b quarks. This algorithm computes the 
probability that all tracks associated with a jet come 
from the primary pp interaction vertex. The proba- 
bility calculation is based on the impact parameters 
of the tracks in the jet, and their uncertainties [32j . 
We consider a jet to be JETPROB-tagged when the 
probability of all jet tracks to come from the primary 



7 



vertex is less than 0.05. Electrons are reconstructed as 
charged particles in the tracking system that leave the 
majority of their energy in the electromagnetic section 
of the calorimeter. Muons are identified as charged 
particles in the tracker that leave hits in the muon 
chambers located outside the calorimeter. If isolated 
high momentum muons are found in the event, $r 
is corrected by subtracting the average muon ioniza- 
tion energy released in the calorimeter and adding the 
muon px to the vector sum. No specific tau identifica- 
tion algorithm is used in this analysis and in Ref . . 
Events with r leptons decaying leptonically are some- 
times collected in the dataset analyzed in Ref. 
by identifying their e or/i decay products, while the 
event selection described here collects hadronic r de- 
cays whenever the decay products are reconstructed 
as jets. We veto events with reconstructed electrons 
and muons in order to keep this sample statistically 
independent from the one analyzed in Ref. [TTj . 

The critical part of this analysis is the requirement 
of the $t signature. The $t in the event can stem 
not only from neutrinos, but also from various instru- 
mental and detector effects. Events containing large 
tiT could have originated from non-collision sources, 
such as cosmic or beam-halo muons passing through 
the detector or noisy or dead calorimeter cells caus- 
ing an energy imbalance. These types of events are 
removed by requiring that the event observables indi- 
cate an inelastic collision with large momentum trans- 
fer, such as the presence of at least one high quality 
primary vertex in the collision and at least one central 
jet with Et > lOGeV. Additional requirements are 
also imposed to remove events consistent with beam- 
halo muons traversing the detector or those caused 
by noisy calorimeter cells. After these requirements, 
the leading source of $t is jet energy mismeasure- 
ment due to either jets pointing to non-instrumented 
regions of the calorimeter, or to calorimeter resolu- 
tion effects. Both categories of events are character- 
ized by often being aligned with the projection 
of one of the jet three-momenta (J) in the azimuthal 
plane. Other characteristic properties of this instru- 
mental background will be described in Sec. lVIi to- 
gether with the strategy devised to suppress it. 



V. SIGNAL AND BACKGROUND 
MODELING 

A. Signal modeling 

The single top quark production is simulated as- 
suming a top quark mass of 175GeV/c 2 using the 
MADEVENT [33j matrix element generator, interfaced 
to the CTEQ5l [34| parametrization of the parton dis- 
tribution functions (PDF) . The madevent generator 
models the polarization of the top quark and the dis- 
tributions of the final state decay products accord- 
ingly. The transition from final state colored particles 



to colorless objects is done through the PYTHIA [35[ 
parton showering and hadronization routines. 

It has been shown that the inclusion of the next- 
to-leading-order diagrams results in an increase in 
the cross section for s-channel production mode, but 
does not change significantly its kinematics @ . The s- 
channel events are thus generated at leading order and 
the cross section is scaled to the next-to-leading-order 
rate @ . 

For the i-channel, the leading-order process for sin- 
gle top quark production is a 2 — > 2 process with 
a b quark in the initial state: b + u — > d + t or 
b + d u + t. Single anti-top quark production 
implies the conjugate processes. As several authors 
have pointed out [6, 36], the distribution of observable 
jets is not adequately represented by the LO contri- 
bution to the t-channel production of single top quark 
and it is better predicted by next-to-leading-order cal- 
culations. In the latter, the b quark stems from a 
gluon splitting into a bb pair. The b quark required 
by the flavor conservation of the strong interaction is 
created by LO parton shower programs through back- 
ward evolution following the DGLAP scheme |37| ■ The 
high-pT tail of the transverse momentum distribution 
of the b quark is not well modeled by this scheme. 
The mismodeling is estimated by comparing with a 
NLO calculation Q . The modeling of the t-channel 
single top quark process can be improved by produc- 
ing simulated events with a matrix element generator, 
followed by the simulation of the production of observ- 
able particles by pythia. For this, two samples are 
used: one for the leading 2 — > 2 process, and one for 
the 2 — > 3 process with a gluon in the initial state 
g + q—^q + t + b. In the latter process, the b quark is 
directly produced in the hard scattering described by 
the matrix element. It also describes the important 
high-pr tail of the b quark pt distribution. The con- 
struction of a Monte Carlo simulated sample following 
the NLO predictions is done by matching the 2^2 
and 2 — > 3 processes as described in 38] . 



B. Background modeling 

There are numerous standard model processes be- 
sides single top quark production that can produce 
the signature characterized by large i£r, relatively 
low jet multiplicity, and no reconstructed charged lep- 
tons. The most significant background at the first 
stage of the analysis is the QCD multijet production. 
Although these processes generally do not produce 
neutrinos, mismeasured jet energies do result in im- 
balance in the measured transverse energy by which 
the QCD events can pass the basic selection. Fur- 
thermore, QCD b quark pair production yields neutri- 
nos whenever one 6-hadron decays semi-leptonically, 
thus giving additional $t- The background sources 
for this final state are due mainly to QCD produc- 
tion of heavy-quark pairs (bb and cc) and jets falsely 
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tagged as 6-jets. 

Because of the high production rate for QCD at 
a hadron collider and the large statistics needed in 
order to describe this process adequately in an anal- 
ysis looking for a very small signal, the Monte Carlo 
simulation of an acceptable amount of QCD events 
is prohibitive. Moreover, the systematic uncertainties 
associated with the Monte Carlo simulation of QCD 
jet production are high. For these reasons, we esti- 
mate the QCD background solely from data. 

Events collected by the i^T+jets trigger are ex- 
pected to be composed mostly of QCD production of 
light flavor jets. We model the heavy flavor jets QCD 
rate and distributions by weighting events without any 
6-tagging requirement by the probability to tag a jet 
as a 6-jet. This probability is extracted from events 
depleted in single top quark signal, i.e. events with 

50 < t T < 70 GeV and A0(# T ,j2) < 0.4, and two 
or three jets [28[ The tag rate per jet is evaluated as 
a ratio of b— tagged to fiducial jets, where the fiducial 
jets are the ones in the kinematic region where the sec- 
ondary vertex detection efficiency is nonzero. The tag 
rate is parametrized in terms of variables sensitive to 
both the efficiency of the identification of true heavy- 
flavored objects and the rate of false tags. These vari- 
ables are the jet Et, the absolute value of the jet rj, 
the scalar sum of the transverse energies of the jets 
in the event {Ht), and the fraction of jet px carried 
by the charged particles inside the jet which are sig- 
nificantly displaced from the collision point. To com- 
pute the last quantity, all charged particles satisfying 
0.5 < Pt < 200 GeV/c are used, and they are required 
to have the distance of closest approach to the beam- 
line (do) significantly displaced from the beamline, i.e. 
Mo/c^ol > 2.5 where <7d is the uncertainty on do. 

The tag rate parametrization is then used to es- 
timate the probability that a fiducial jet in the sig- 
nal candidate sample is tagged. We construct three 
independent parametrizations to estimate the back- 
ground in events with exactly one SECVTX-tagged jet 
(IS), two SECVTX-tagged jets (2S category), and one 
SECVTX-tagged and one JETPROB-tagged jet (SJ cat- 
egory). Events which belong to both 2S and SJ cate- 
gories are assigned to the 2S subsample. Events with 
three 6-tagged jets are discarded. In this way the three 
selections are orthogonal by construction so that an 
event can belong to only one category. 

By summing the probability of 6-tagging each fidu- 
cial jet in each 6-tag subsample and weighting the rate 
and distributions of data events before any b— tagging 
requirement (pretag sample), we predict the rate of 
QCD 6-tagged jet multijct production background 
events and its kinematic distributions. We predict 
the kinematic properties of events with one SECVTX- 
tagged jet QCD background from the pretag sam- 
ple, and the kinematic properties of QCD background 
events in the 2S and SJ categories from events with 
one SECVTX-tagged jet. The parametrizations do not 
completely account for the fact that events with pair 



production of heavy flavor have enhanced probabil- 
ity to be tagged. For this reason, the normaliza- 
tion of the background events arising from the sim- 
ple application of the parametrization needs to be 
scaled. This normalization procedure is described in 
the next section. The performance of the parametriza- 
tions and of the resulting estimate is shown in Fig. [5] 
where we compare our background model to data 
in a QCD dominated region containing events with 

t T > 70 GeV and A(f}(# T ,f 2 ) < 0.4. We see that the 
tag rate parametrization produces a good modeling of 
the kinematic properties of the QCD events. When 
building the QCD model in samples containing size- 
able contributions of non-QCD process such as W + 
jets production, we apply the tag rate parametrization 
to our Monte Carlo simulation of non-QCD processes, 
and subtract the output from the QCD background 
estimate. The normalization of the QCD background 
contribution is set as a scale factor derived in a control 
region, which multiplies the prediction obtained with 
the tag rate parametrization, as described in the next 
section. 

The other backgrounds to single top quark produc- 
tion in this signature come from the production of a W 
or a Z boson in association with jets, top quark pair 
production through strong interactions, and pair pro- 
duction of heavy vector bosons. All these background 
processes are simulated using the PYTHIA Monte Carlo 
simulation program. 

We normalized the W and Z boson + heavy fla- 
vor jets backgrounds using the inclusive cross sections 
measured by CDF [391 ] . The measurements correspond 
to a factor of 1.4 with respect to the pythia LO pre- 
dictions. In this way, Z /W boson samples have the 
correct normalization with respect to their inclusive 
production. However, the heavy flavor production 
simulated by pythia provides a possible source of sys- 
tematic error. We assign a 40% uncertainty based on 
the total uncertainty of the Z + heavy flavor jets cross 
section measurement in CDF (4(| . The production of 
W and Z bosons in association with light flavor jets 
is included in the multijet modeling. 

Top quark pair production yields a non-negligible 
contribution to the background in the signal region. 
Due to the large top quark mass and the leptonic de- 
cay of the W boson originating from the top quark, 
these events are energetic, have large $t, and have 
high jet multiplicity. As for the signal events, in this 
analysis simulated it events were generated assuming 
a top quark mass of 175GeV/c 2 . The cross section 
corresponding to that mass has been computed to be 
6.7 ±0.8 pbp|. 

For the simulated diboson samples we use LO cross 
sections scaled by a factor corresponding to the ratio 
between the NLO and LO cross section prediction in 
MCFM [4l|. The boson decays are set to be inclusive. 
An 11.5% uncertainty on the MCFM cross section is 
assigned to the diboson normalization [42[ . 
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FIG. 2: Comparison of the background modeling to data 
in a QCD-dominated control region. The green area rep- 
resents the multijet background model, while the points 
represent the data. The bin at the right end of the i-axis 
represents the overflow bin. The distributions of the ob- 
servables under study show good agreement between data 
and the background model. 



VI. EVENT SELECTION 

A. Preselection and topology requirements 

We define the signal region by selecting events in 
the kinematic region where the trigger is highly effi- 
cient: $ T > 50 GeV, E% > 35 GeV and E 3 T 2 >25 GeV, 
AR(j\, j'2) > 1 and by requiring the number of jets to 
be no more than three, thus accepting events with ex- 
tra radiation from the incoming or outgoing partons, 
or hadronically-decaying taus reconstructed as jets. In 



the case of three jet events, the jet with lowest Et is 
required to have E 3 ^ > 15 GeV. Events with four or 
more jets with Et > 15 GeV and \rj\ < 2.4 are rejected 
to reduce backgrounds from QCD and tt production. 
We veto events containing well-identified electrons or 
muons as identified in Ref. [l3[ to ensure the indepen- 
dence of the analyzed samples. About 523 000 events 
pass these preselection requirements, where the s/b 
ratio for single top quark events after this selection 
is about 1/2 800 assuming SM cross sections. At this 
stage of the analysis, the vast majority of the back- 
ground events are QCD events where mismeasuremcnt 
of the jet energies gives the very large $t- These 
events are characterized by having $t aligned in the 
axymuthal angle <p with one of the jets (ji) in the 
event, where the index i runs over the jets in the event. 
We require 

A<f>(# T ,fi) > 1-5 and A(f>(E T , ja.a) > 0.4 
to reject such events. These cuts remove about an 
order of magnitude of QCD events, but still leave us 
with a s/b of ~ 1 /340, where the majority of the back- 
ground is still composed of QCD multijet production. 
Finally, we require the presence of at least one jet 
identified as originating from a b quark. We divide 
the sample in the three exclusive subsamples IS, SJ 
and 2S defined in Sec.fV] This requirement brings the 
average s/b ratio over the three subsamples to ~ 1/50, 
where the QCD background is still the dominant one. 
Thus, we need to exploit additional properties of these 
events in order to further increase the purity of the 
sample. 



B. Neural network based event selection 

We introduce here a neural network approach to 
the event selection to recognize and separate QCD 
multijet events with mismeasured jets in which $t 
is due to instrumental effects from events with $t 
originating from neutrinos. In addition, the neural 
network is designed to reject events with mis-tagged 
light flavor jets. Using a neural network instead of a 
"cut-cascade" approach to event selection allows the 
exploitation of the correlation between the many ob- 
servables which provide discrimination between signal 
and backgrounds, and gives a single output thus sim- 
plifying the determination of the optimal cut. The 
neural network model chosen is the Multi Layer Per- 
ceptron [43[ as implemented in the tmva package [3] , 
found in ROOT |45j |. 

In this analysis the charged particle spectrometer 
is used in an innovative way to discriminate between 
events containing high energy neutrinos and QCD 
events. We introduce here the imbalance in the mo- 
mentum flow in the transverse plane, and name it 

missing transverse momentum, or <pr, in analogy with 

the missing transverse energy, Et- To compute Pt, we 
select charged particles with 0.5 < pt < 200GeV/c 
and z position at the beamline compatible with the z 
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position of the primary vertex. The missing transverse 

momentum is then defined as fx = — StracksP? 1 - In 
collisions producing high energy neutrinos, the magni- 
tude of p'x {p't) is proportional to the neutrino energy, 
while p't provides a good estimate of the neutrino di- 
rection. In QCD events with the t,x plus jets topol- 
ogy, where high energy neutrinos are rarely produced 
and the fluctuation of the charged-to-neutral ratio in 
jet fragmentation is the primary source of imbalance 
of the total transverse momentum, the magnitude of 

•p'x is expected to be relatively low, and the vector fx 
to be often aligned in direction with the momentum 
of one of the most energetic jets. 

As inputs into our neural network we use the fol- 
lowing variables which describe the energy and mo- 
mentum flow in the detector: the absolute amount 
of the missing transverse energy, Ex', the absolute 
amount of the missing transverse momentum fx ', the 
Ex significance defined as Ex I \/^Er, where ^ Ex 
is a scalar sum over the energy deposited in the 
calorimeter towers; the ratio of $x to Hx; the ra- 
tio of fix = | — SjetsPrl and fex\ the invariant mass 

of Ex, ii and f 2 , M{E T ,jx,h)- 

We use the following angular variables: the az- 

imuthal difference between fx and $x, A(f>(p / x, $t)', 
the maximum of Atfi between any two jets ji,jk, 
Max(A<j)(ji, jfe)); the maximum of the difference in R 
between any two jets ji,jk, Max(AR(ji, jk))', the min- 
imum of the difference in <j> between $x and any jet 
ji, M in(A<fr($x, ji))', the minimum of the difference 

in <j> between the fx and any jet ji, Min(A(f)(fx , ji))', 
the difference in the azimuthal plane between the axis 
defined by the two most energetic jets in their rest 
frame, and the vector sum of the two jets in the lab 
frame, </>* ; the event sphericity [46| . 

We also use variables that discriminate between the 
fragmentation properties of heavy flavor quark jets 
and jets originating from light flavor quarks or gluons: 
by taking the charged particles with px > 0.5GeV/c 
and |do/ cr dol > 2.5 and contained inside the jet cone, 
we build the variable px chgd j 'fx-' for the Ex lead- 
ing and second leading jets. The 15 variables used 
as inputs to the neural network are summarized in 
TableU 

Comparisons of the kinematic distributions for 
background and signal events for the 15 variables are 
shown in Figs. [3] and[H where for simplicity the three 
subsamples IS, 2S and SJ are combined. 

The QCD background kinematics do not vary signif- 
icantly with the heavy flavor content so only one neu- 
ral network is used for the three 6-tagged subsamples. 
The single top quark signal used for the training is 
a mixture of Monte Carlo simulated s-channel events 
(50%) and i-channel events (50%), which corresponds 
to the predicted signal composition after preselection. 
For the background, we use the multijet background 
model described in Sec. El All samples are split into 



two subsamples: one for training the neural network, 
and one for making predictions of the neural network 
output and for testing for overtraining. The distri- 
butions for training and testing samples are in good 
agreement. Both samples contain 21000 signal and 
14000 multijet background events. Among the con- 
figurations investigated, the one which performs best 
uses all 15 variables defined above as inputs, two hid- 
den layers with 30 and 15 nodes respectively, and one 
output node. Figured] shows the distribution of the 
value of the output node, NNqcd- 



The signal region is defined as the sample of events 
surviving the cut on NNqcd devised to maximize the 
background rejection while retaining high signal effi- 
ciency. By requiring NNqcd >-0.1, we reduce the 
multijet contribution by 77%, while keeping 91% of 
the signal. The overall backgrounds are reduced by 
65% thus bringing the s/b ratio from 1/50 to 1/20. 
The signal significance s/y/s + b is increased by 50%, 
from 1.2 to 1.7. We look at the background-dominated 
region with NNqcd < —0.1 to verify that the back- 
ground model properly describes the data. Figure|6] 
shows distributions of data events superimposed to 
the sum of the expected backgrounds; the background 
model properly describes the data within uncertain- 
ties. Moreover, data events in the same region are 
used to compute the normalization for multijet pro- 
duction, by comparing the prediction given by weight- 
ing the events for the tag rate parametrization, and 
the observed number of data events. We find that the 
predictions must be multiplied by scale factors that 
depend on the combination of 6-tagging algorithms 
used. The values extracted are 1.08 ± 0.05 for events 
in the IS subsample, 0.79 ± 0.10 for 2S events, and 
0.76 ± 0.07 for SJ events. TablellTl shows the contri- 
bution of signal and background events in the signal 
sample, divided into three subsamples under study. 
After requiring NNqcd > —0.1, the dominant back- 
grounds are from multijet production, W/Z + heavy 
flavor jets events, and tt events. The multijet con- 
tribution after the NNqcd > —0.1 cut now primar- 
ily consists of events with true missing energy coming 
from a W or Z boson, accompanied by light-flavor jets 
misidentified as &-jets. 



We use our simulated single top quark data sam- 
ple to investigate which decay modes of the W bosons 
from top quark decays survive the event selection in- 
cluding the lepton veto. We find that the largest cate- 
gory of single top quark decays is W — > rv (~ 50% of 
decays), followed by W — > \w (~ 30%) and W — !• ev 
(~ 20%). The presence of hadronic W decays is 
suppressed by the requirement of large $x, and by 
the NNqcd > —0.1 cut, so that the fraction of all- 
hadronic single top decays is found to be negligible. 



11 




FIG. 3: Kinematic distributions for signal and background events passing the event preselection. The three subsamples 
are summed together in their respective proportions. All histograms are normalized to unit area. 
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TABLE I: Input variables to the neural network devised to suppress the multijet background. 

Variable Description 

Bt Absolute amount of the missing transverse energy 

P't Absolute amount of the missing transverse momentum 

$t/ yf^Er Missing significance 

Pt/Ht Ratio of # T to H T 

jt T I £ T Ratio of $ T to t T 

M(Et, ji, 32) Invariant mass of Et, ji and j% 

A(J)(Et,Pt) Azymuthal difference between Et and Pt 

Max(A(j)(ji, jk)) Maximum of A<j) between any two jets ji,jk 

Max(AR(ji, jk)) Maximum of AR between any two jets ji,jk 

Min(A<fi($T, ji)) Minimum of A<f> between t^r and any jet ji 

Min(A(f>(pT)ji)) Minimum of A<f> between pr and any jet ji 

4>* Aifi between the (j 1 , j 2 ) axis in their rest frame, and their vector sum in the lab frame 

Sphericity S = |(A 2 + A 3 ) [46] 

J2PT ch9d /pt^ 1 Fraction of px^ 1 carried by charged particles displaced from the primary vertex 

J2pT ch9d /pt^ 2 Fraction of px-* 2 carried by charged particles displaced from the primary vertex 
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FIG. 4: Kinematic distributions for signal and background events passing the event preselection. The three subsamples 
are summed together in their respective proportions. All histograms are normalized to unit area. 



VII. DISCRIMINATING THE SIGNAL FROM 
REMAINING BACKGROUNDS 

In the previous section we described an event selec- 
tion which enhances the signal purity of the sample by 
suppressing the presence of backgrounds that do not 
produce real neutrinos. At this stage of the analysis, 
the s/b is about 1 /20 , where the main background pro- 
cesses all produce neutrinos. Unfortunately, all of the 
surviving backgrounds have topology and kinemat- 
ics which are very similar to single top quark events. 
In addition, the systematic uncertainty on the back- 
ground prediction is approximately 2 to 4 times the 
size of the signal we seek, depending on the subsam- 
ple. 

Further discrimination of the signal from the back- 
ground is required. In order to increase the statistical 
power of the analysis, and to minimize the effect of 
the background systematic uncertainties, we study the 



signal sample to take advantage of the small residual 
differences between the signal and backgrounds. An- 
other neural network will be used for this purpose, 
where events which appear to be more signal-like are 
used to test for the presence of single top quark pro- 
duction and to measure the cross section, and events 
which appear to be more background-like are used to 
constrain the uncertain background rates. 

We use the following variables to discriminate 
between signal and background processes: the 

invariant mass of j 2 and Et, M(j 2 ,$T)', Ht', 

Min{A4>{£ T ,n)); EPT ch9d JPT n ; Y.PT chgd Ipt 32 ; 

$t; p't; <t>*; $t/H t ; M($ T ,fi,32); the invariant 
mass of all jets, M(alljets). All the above observ- 
ables are used as inputs to a multi-layer-perceptron 
neural network trained to distinguish the signal from 
backgrounds in the sample with NNqcd > —0.1. We 
use the simulated single top quark s- and i-channel 
samples in their expected proportions to build the sig- 
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FIG. 5: Distributions of the NNqcd output for events passing the event pre-selection. (a) shows the distribution for 
the signal and the backgrounds normalized to unity. The remaining three plots show the same distribution for the three 
subsamples, IS (b) 2S (c) and SJ (d), where the background and signal predictions are stacked according to predictions, 
and compared to data events. As can be seen from the plots, the kinematics of the QCD background events are very 
different from the signal and the other backgrounds. These events are removed using the cut NNqcd < —0.1. The 
remaining events are used to scan for the presence of the single top quark signal. 



nal sample (approximately 50%-50%). For training 
purposes, we select the background processes which 
account for more than 5% of the total background: 
multijet, W — > tv plus heavy flavor jets, Z — > vv 
plus heavy flavor jets, and tt production. Both train- 
ing and test samples contain 39 000 signal and 42 000 
background events. The network architecture consists 
of an input layer with 11 nodes corresponding to the 
input variables shown in Table lVTll plus one bias node; 
one hidden layer with 22 nodes and one hidden layer 



with 11 nodes, and an output layer with one output 
node, which we label NN sig . We compare the NN sig 
output distribution between the training and test- 
ing samples and find good agreement. The distribu- 
tions of the input variables for events in the signal re- 
gion, for all b— tagged subsamples, are shown in Fig. [71 
where the shapes of the distributions for each group of 
physics processes are compared. Figure|S] shows that 
the predictions agree well with the observed data. The 
output of NN S ig is shown in Fig. [HI where the con- 
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FIG. 6: Comparison of the QCD background modeling to data in the NNqcd < —0.1 control region. The three 
subsamples are summed together in their respective proportions. Good agreement is visible between data and the 
background model. 



tributions from the signal and main backgrounds are 
normalized to unit area. The single top quark signal 
events populate mostly the region of NN sig around 0.3, 
while background events populate mostly the region 
with NN S ig around -0.3. Finally, the same distribution 
is shown in Fig. 1101 where the signal and backgrounds 
contributions have been normalized according to their 
estimates in the three subsamples, and the data are 
superimposed. 



VIII. SYSTEMATIC UNCERTAINTIES 

Systematic uncertainties are grouped by their 
sources, where a given source of uncertainty may af- 



fect several background and signal distributions. The 
various systematic uncertainties originating from the 
same source are considered 100% correlated. There 
are two categories of systematic uncertainties. Rate 
uncertainties arc related to the predicted production 
rates, efficiency and acceptance of the various signal 
and background processes. Shape uncertainties ex- 
press differences in the distributions due to a given 
systematic source. Some sources of systematic uncer- 
tainty affect both rates and shapes. All rate uncer- 
tainties are assigned a truncated Gaussian prior, pre- 
venting negative predictions. For shape uncertainties, 
if the prediction for a given bin is negative, it is set to 
zero. 
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FIG. 7: Kinematic distributions for the signal and background events in the signal region (NNqcd > —0.1). The three 
subsamples are summed together in their respective proportions. All histograms are normalized to unit area. 
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FIG. 8: Kinematic distributions for the signal and background events in the signal region (NNqcd > —0.1). The three 
subsamples are summed together in their respective proportions. All physics processes contributions are normalized to 
the expected amount of events, as described in Sec.lVl 
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TABLE II: Number of predicted and observed events in 
the signal region defined by requiring NNqcd > —0.1 in 
the subsample with exactly one SECVTX-tagged jet (IS), 
two SECVTX-tagged jets (2S) and one secvtx- and one 
jETPROB-tagged jet (SJ). The notation "h.f." stands for 
heavy flavor jets. The uncertainty in the predicted number 
of events is due to the theoretical cross section uncertainty 
and to the uncertainty on signal and background modeling. 

Process lb-tag (IS) 2 fr-tags(2S) 2 b-tags(SJ) 



s- channel 


15.7±2.0 


7.6±0.9 


6.3±0.8 


t-channel 


31.2±4.9 


1.7±0.2 


1.6±0.2 


tt 


125±23 


30.3±5.8 


29.2±5.7 


ww/wz/zz 


33.0±6.5 


4.9±0.6 


4.2±0.6 


W + h.f. 


269±113 


12.7±7.5 


22.7±13.7 


Z + h.f 


105±53 


11.8±5.8 


11.8±6.0 


multijet 


592±27 


28.9±3.8 


58.5±5.8 


Total 


1172±169 


98±15 


134±21 


Observed 


1167 


113 


131 



B. Integrated luminosity 

This systematic source accounts for the uncertainty 
in the pp inelastic cross section and for the uncertainty 
in the acceptance of the luminosity monitor of CDF to 
inelastic pp collision events [47| and it is estimated as 
6%. This uncertainty is applied to the rate predictions 
based on Monte Carlo simulation, i.e. all processes 
apart from multijet production. 

C. Trigger efficiency 

Since we are using data below the fully efficient 
region of the trigger, we apply a parametrization of 
the trigger efficiency to the Monte Carlo simulated 
backgrounds. We assign a systematic uncertainty to 
both Monte Carlo simulated backgrounds and signal 
acceptances by varying the trigger efficiency parame- 
ters [28[ . 
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D. b-tagging efficiency 

The ^-tagging efficiency affects the predicted rates 
of signal and backgrounds estimates for which we use 
Monte Carlo simulations. Known differences between 
the data and the simulation are corrected by scaling 
the simulation, and uncertainties on these scale fac- 
tors are collected together in one source of uncertainty 
(they affect the predictions in the same way). We as- 
sign an uncertainty of 4.3% for the IS prediction, 8.6% 
for the 2S and and 12% for SJ. 



E. Lepton veto 

The uncertainty in the efficiency of the cuts used to 
veto leptons was determined to be 2%. 



FIG. 9: Distribution of NN a i s for the signal and main 
background processes in the signal region. The three sub- 
samples are summed together in their respective propor- 
tions. All histograms are normalized to unity. A small 
but important residual discrimination of the signal from 
the backgrounds is obtained. 



A. Theoretical cross sections 

For all physics processes modeled by Monte Carlo 
simulation, we normalize to the most up-to-date the- 
oretical computation of the cross section, and corre- 
sponding uncertainties. We use 12% uncertainty for 
top quark pair production [l[ , 40% uncertainty for the 
W and Z background processes [4(J . and 11% for the 
diboson prediction |42j . 



F. Initial and final state radiation (ISR/FSR) 

To evaluate the ISR, pythia uses a model of "back- 
wards evolution" [35|, |48[ . The model used by pythia 
for gluon radiation from partons emitted from the 
hard-scattering interaction has been tuned with LEP 
data. Monte Carlo simulated samples are generated 
for single top quark signals and ti with Aqcd dou- 
bled (more ISR) or divided in half (less ISR) and with 
the initial transverse momentum scale and the hard 
scattering of the shower both multiplied (more ISR) 
or divided (less ISR) by four. The parameters for the 
final-state showering are also adjusted in pythia, ex- 
cept for the hard-scattering scale. The uncertainties 
are then computed by comparing the efficiencies and 
kinematics of the varied ISR/FSR events to the nom- 
inal ones. The effects of variations in ISR and FSR 
are treated as 100% correlated with each other. 
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TABLE III: Input variables to the neural network aimed at discriminating single top quark production from the back- 
grounds remaining after the NNqcd > —0.1 requirement. 



Variable 



Description 



.n 



M(Et,J2) Invariant mass of &t and j2 

Ht Scalar sum of the jet energies 

Min(A<f)(#T,ji)) Minimum of Acj) between $t and any jet j. 

Fraction of pt 31 carried by charged particles displaced from the primary vertex 
Fraction of pt 32 carried by charged particles displaced from the primary vertex 
Missing transverse energy 
Missing transverse momentum 

A(j) between the (j 1 ,,? 2 ) axis in their rest frame, and their vector sum in the lab frame 
Ratio between &t and Ht 

Invariant mass of $t , ji and fa 
Invariant mass of all jets in the event 



J2pT° h9d /pT 
Y.VT Ch9d IPT n 
$T 

ir 



$t/Ht 

M{£ T ,flj2) 

M(alljets) 



G. Jet energy scale (JES) 



Multijet model 



Each step in the correction of the calorimeter re- 
sponse to particle jets involves an uncertainty, which 
is propagated to the final JESp9}. The effect of JES 
uncertainties are estimated by varying the jet energy 
scale in all Monte Carlo simulated samples twice: one 
upwards, one downwards. They are evaluated for each 
background and signal contribution, and both rate 
and shape uncertainties are taken into account. 



H. Parton distribution functions (PDF) 

Lack of precise knowledge of the PDFs is a source of 
theoretical uncertainty for the amount of signal pro- 
duced. The uncertainty is estimated using different 
sets of PDF eigenvectors. The default PDF set used 
in this analysis is the CTEQ5L set [34[ . The uncertainty 
is determined comparing: 

• two different LO PDF parametrization sets, 
CTEQ5L [13 and MRST72 gg|; 

• MRST72 and MRST75 with different A QC d; 

• the variation within their uncertainties of each 
of the 20 signed eigenvectors of the NLO PDF 
set CTEQ6M |c| with the default PDF set. 

The total PDF uncertainty is obtained adding the 
larger of the 20 eigenvectors' uncertainty (all added 
in quadrature) or the MRST72 and CTEQ5L PDF 
sets uncertainty in quadrature with the Aqcd uncer- 
tainty. A 2% uncertainty was found to be sufficient 
for all the backgrounds. The PDF uncertainty on the 
signal acceptance range from 1% to 2% depending to 
the subsample. Shape variations induced by PDF sys- 
tematic changes are considered only for the single top 
quark process. 



The data-driven model for multijet production pre- 
dicts the shapes of the distributions. The rates are 
obtained from the NNqcd < —0.1 control region. In 
this region, we assign a scale factor associated with 
the difference between the data rates and the Monte 
Carlo simulation prediction, which we then multiply 
to the multijet predicted rate. We then obtain the un- 
certainty on this scaling using propagation of errors. 
Depending on the subsample under study, we assign 
an uncertainty between 4.5% and 13%. The varia- 
tions in the tag rate probability parametrization used 
to estimate the multijet background also modify the 
shapes of the distributions. The shape uncertainty is 
obtained by varying the tag rate probability by the 
uncertainty in its estimation. We also take into ac- 
count the normalization uncertainty on the processes 
which are part of the background in the region from 
which we get the QCD normalization. Those uncer- 
tainties are anti-correlated with respect to the nor- 
malization on these processes, and are weighted ap- 
propriately. In the NNqcd < —0.1 control sample 
used to derive the normalization, the biggest contam- 
ination sources come from W + heavy flavor jets (5% 
of the IS and < 1.5% of the 2S and SJ samples), tt 
(2.7% of the 2S sample and < 1.5% of the two other 
samples), and Z + heavy flavor jets (< 1.8% of the 
three b— tagged subsamples). Diboson contamination 
is negligible (< 0.5%). To avoid double counting non- 
multijet events in our estimation of the multijet back- 
ground, we apply the tag rate parametrization to our 
Monte Carlo simulation predictions and subtract the 
output from the data. The single top quark signal 
as predicted by Monte Carlo simulation is also sub- 
tracted from the data. We associate a shape system- 
atic uncertainty to this removal by varying the amount 
of single top quark we subtract by 50%, more than 
three times the theoretical uncertainty on the single 
top quark cross section. 



19 



a 

> 





(a) 



(b) 




NN . 

sig 




(c) 



DATA 
Single Top 
Multijet 
Diboson 
W/Z + h.f. 
Top Pair 



FIG. 10: NN s ig discriminant output distributions in signal region, for the IS subsample (a) , 2S(b), SJ(c). 



J. Background scaling 



K. Top quark mass dependence 



A small fraction of the data events analyzed in 
this paper pass the event selection requiring identified 
charged leptons in the final state 13] . The fraction has 
been computed using single top quark Monte Carlo 
simulation to be 2%. To maintain a 100% orthogo- 
nality with Ref. [Tj3], these events are discarded from 
this analysis. We scale down the predicted amount 
of background events by 2%, and assign an additional 
uncertainty of 2% to the background yields. 



The most precise measurement of the top quark 
mass corresponds to M top = 173.1 ± 1.3GeV/c 2 [5lj ). 
We consider for this analysis a nominal top quark 
mass of M top = 175 GeV/c 2 for the acceptance com- 
putation and kinematic estimation, and use the two 
extreme values of 170 and 180 GeV/c 2 to compute the 
systematic shifts. This uncertainty is considered for 
all processes producing top quarks when extracting 
the value of V t b and computing the significance of the 
measurement. 



The summary of the systematic sources, their effect 
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on the rates of different processes and how they affect 
the kinematics can be found in Table IIVI 



IX. CROSS SECTION AND SIGNIFICANCE 
EXTRACTION 

We scan the NN s ; g distribution using a binned like- 
lihood technique to measure its cross section, as well 
as to determine the significance of the excess itself. 
The likelihood function L is given by the product of 
the likelihood for each of the different subsamples L c , 
L = n_=i ^ c wnere N c = 3 are the three subsamples 
subdivided according to the number of 6— tagged jets 
and the tagging algorithm used (IS, 2S, SJ). The like- 
lihood L c for each subsample to observe the data in 
the final NN distribution is defined as: 

^bins ^bins m — ■ 

l c = JJp(n,| Mi )= II ^fr— • W 

%—\ l—l 

where n, is the data count in that particular bin and 
^bins is the number of bins in the distribution which 
is scanned to look for an excess of signal-like events. 
The prediction in each bin is a sum over signal and 
background contributions: 

"■bkg 

k=l 

where bik is the background prediction in bin i for 
background source k given the number of background 
sources ribkg and Sj is the signal prediction in bin i 
for the s— and t— channel single top quark production 
summed according to the standard model proportions. 
Uncertain nuisance parameters 6 affect the signal and 
background predictions and kinematics. The induced 
effect on the event rates and shapes of the kinematic 
distributions can be correlated with each other. The 
likelihood L is then a function of the observed data 
D, the signal cross section cr s +t and of the nuisance 
parameters 6 which affect the signal and background 
predictions, L(D\a s+t ,d). We use Bayes' theorem to 
convert the likelihood into a posterior density function 
in a s +t ■ We use the posterior density function to quote 
the measured value of the production rate [52j ■ The 
posterior density function is: 

p(a a+t \D) = ^ j L(D\a s+t ,e)7T(9,a s+t )de (3) 

where N is an overall normalization obtained from 
the requirement f p(a s +t \D)da = 1. The function 
%{6,a s +t) is the prior probability density, which en- 
codes our knowledge of the parameters 6 and cr s +t- 
Since our knowledge of the cross section is indepen- 
dent of our prior knowledge of the nuisance parame- 
ters, we can write the prior density as ir(0,<7 s+t ) = 
tt(0) ■ 7r(<7 s +t). The ignorance about the true single 



top quark production cross section is encoded through 
the prior probability density function, which is set 
equal to the Heaviside function in tx s +t, tt(ct s +t) = 1 if 
Cs+t > and Tr(a s +t) = otherwise. The prior prob- 
ability density function on the nuisance parameters 
are set to be Gaussian distributions, characterized by 
the estimated central value of the systematic source 
and the associated uncertainty. The posterior density 
function can thus be written as 

p(a s+t ,D) = ± J L(D\v a+t ,O)n(fi)ic{<T B+t )d0 (4) 

The marginalization of the posterior density function 
is done using Monte Carlo integration, by generating 
a large number of points in the nuisance parameters 
space, according to their priors probability density 
functions. 

In doing the generation, we take into account the 
correlations between nuisance parameters. Shape and 
rate uncertainties due to a given nuisance parameter 
are treated as 100% correlated. We define the mea- 
sured cross section cr^f s as the value corresponding 
to the mode of the p(<7 s+t \D) distribution, and its 
uncertainty as the smallest interval enclosing 68% of 
the integral. In order to measure the single top quark 
production cross section and its uncertainty, we do not 
include the m± uncertainty in the ti background or in 
the signal, but rather quote the measurement at the 
assumed top quark pole mass of m t = 175GeV/c 2 . 
We assume the standard model ratio between s- and 
t-channel production. 

The measured cross section depends on the true 
cross section but also the random outcome of the data. 
The sensitivity of the analysis is evaluated using the 
expected distribution of outcomes, assuming a signal 
is present. The event selection cut values and the final 
discriminant used to scan for the signal have been cho- 
sen to maximize the expected sensitivity to the signal. 
We check our cross section fit method using pseudo- 
experiments generated varying the input signal cross 
section and systematic uncertainties, which are then 
fit to measure the signal cross section to check for 
possible biases. The procedure used cannot produce 
a negative cross section measurement, since the priors 
are zero for negative values. For an input cross sec- 
tion of zero, half of the measured cross sections then 
are exactly zero, and the other half form a distribu- 
tion of positive fit cross sections. We therefore use 
the median fit cross section for our linearity check to 
avoid the bias which would be introduced by using 
the average instead. Distributions of 68% and 95% of 
extracted cross sections centered on the medians are 
then shown as a function of the input cross section in 
Fig.HTI It can be deduced from the same plot that 
the fit technique used here does not introduce a bias. 
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TABLE IV: Summary of systematic uncertainties and their treatment in the analysis. A range of values is listed for a 
systematic source to indicate that the effect is different in the three subsamples defined by the different heavy flavor jet 
content. The "X" sign in the "Shape column" means that the influence of the change in the systematic source on the 
kinematic distributions of the physics processes has been considered, while the "-" sign means that it is not applicable, or 
not considered. The "comment" column describes whenever a systematic is considered only for some physics processes, 
or treated differently for some physics processes, or considered only in certain computations. 

Systematic source Rate Shape Comment 

Top quark pair production cross section ± 12% 
W / Z + heavy flavor jets cross section ± 40% 



Diboson cross section 


± 11% 






Luminosity 


6% 




Not for multijet 


Trigger efficiency 


< 2.6% 


X 




b— tagging efficiency 


4.3% to 12% 






Lepton veto 


2% 






ISR/FSR 


-4.5% . . . +16% 


X 


Only for top quark processes 


JES 


-14% . . . +23% 


X 




PDF 


±1%...±2% 


X 


Shape for signal only 


Multijet model 


4.5% . . . 13% 


X 





Background scaling 2% 

Single top quark cross section ± 12% „ , „ . . T . 

rp i i j 1 co/ ,-7 co/ v Only for p-value and 14b computati 

Top quark mass dependence -16% . . . +7.5% X 
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FIG. 11: Distributions of 68% and 95% of extracted cross 
sections centered on the medians are shown as a func- 
tion of a set of input cross section. The line represents 
^mcas _ 0.2» ( _ The plot shows that the fit technique does 
not introduce bias. 



A. Significance calculation 

In addition to measuring its cross section, it is also 
important to estimate the significance of the measure- 
ment itself. To do so, we use the p- value, which is 
the probability of observing an outcome of our exper- 
iment at least as signal-like as the one observed or 
more, assuming that a signal is absent. By conven- 



tion, an observed p- value of less than 1.35 x 10~ 3 con- 
stitutes evidence for a signal, and an observed p- value 
less than 2.87 x 10 -7 constitutes a discovery. These are 
the one-sided integrals of the tails of a unit Gaussian 
distribution beyond +3ct and +5ct, respectively. The 
experimental outcome is ranked on a one-dimentional 
scale using the likelihood ratio test statistic [53| : 



21nQ 



-2 In 



L(D\a 



s+t 



' s+t/ 



L(D\a s+t = 0) 



(5) 



In the computation of the — 21nQ and thus the p- 
value, we include all sources of systematic uncertain- 
ties, including the theoretical uncertainty on the sin- 
gle top quark production cross section and on the top 
quark mass itself. We perform two sets of a large num- 
ber of pseudoexperiments and compute corresponding 
— 21nQ distributions. In the first one, pseudodata are 
generated in the hypothesis that single top quark pro- 
duction is present in the SM-predicted amount (S+B). 
In the second, pseudodata are generated according to 
the background-only hypothesis (B). The p- value is 
the probability that — 21nQ < — 21n<5 mcas in the B 
hypothesis. To compute the expected p— value, we set 
Qmcas as the mode of the Q distribution assuming that 
a' is equal to the theoretical prediction for the SM sin- 
gle top quark production cross section. To compare 
the data with the SM predictions, we then set a' to be 
equal to of+f 5 and compute the corresponding Q 

mo as 

and then the observed p-value. 
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B. Constraining Vtb 

We can also use the knowledge of the standard 
model prediction for the single top quark produc- 
tion cross section to compute the Vtb element of 
the Cabibbo-Kobayashi-Maskawa matrix. Under the 
standard model hypothesis, with the assumption that 
\V td \ 2 + \V ts \ 2 < \V tb \ 2 and that only |F t6 | incorpo- 
rates new physics contributions, one can measure Vtb 
using the relation \Vtb\ 2 = "'^u ■ The theoretical un- 

CT s + t 

certainty on (T~¥j. is taken into account when setting 
the Vtb constraints, together with the uncertainty on 
the top quark pole mass measurement. 



Interpreting the 2.1 a excess as originating from sin- 
gle top quark production, we measure a single top 
quark production cross section of 

< + T - 4.9t|* pb. 

The value measured in the data is compared to the 
measurement outcomes from 150 000 pseudoexperi- 
ments, as shown in Fig. 1131 The single top quark 
production cross section measurement presented here 
is consistent with a +1 a statistical upward fluctua- 
tion with respect to the standard model cross section. 
The probability to measure a cross section higher than 
4.9 pb has been estimated to be 18%. 



X. RESULTS 

Using the signal and background modeling de- 
scribed in Sec. El scanning the multivariate discrim- 
inant described in Sec. lVIII and using the statistical 
test described above, we compute the probablility that 
the background (B) looks at least as signal-like as the 
data (observed p-value) or as the median of signal 
plus background (S+B) pseudo-experiments outcomes 
(expected p-value). Once including all systematic 
sources, we obtain an expected p-value of 7.9 x 10~ 2 
(1.4cr) and an observed p-value of 1.6 x 10~ 2 (2.1 a). 
The distributions of —2 In Q for the B or S+B hypoth- 
esis are shown in Fig. [12] together with the observed 
outcome in the data. 
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FIG. 12: Distribution of -21nQ for the B or S+B hy- 
pothesis. The dark vertical arrow shows the mode of the 
— 21nQ distribution in the S+B hypothesis using pseu- 
doexperiments, while the bright vertical arrow shows the 
outcome in data. The expected (observed) p-value is the 
integral of the — 21nQ distribution in the B hypothesis at 
the left of the dark (bright) arrow. 
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FIG. 13: Distribution of cross section measurement out- 
comes using pseudoexperiments. The green arrow shows 
the cross section measured in the data. The probability to 
measure a cross section higher than the median expected 
cross section shown as a dotted line is 18%. 

As a cross-check, we perform the measurement sep- 
arately in the three subsamples. The results are shown 
in Fig.[T4land indicate that, while the precision is low, 
the three orthogonal measurements are in agreement 
with each other. 

Finally, we measure the Vtb element of the CKM 
matrix. Using an unconstrained flat prior on \Vt 



we find \V tb \ = 1.24±g;|| ± 0.07(theory) as shown in 



ib\ 



Fig. 15(a) 



The theoretical uncertainty is due to the 
uncertainty on the standard model theoretical cross 
section for single top quark production [(| Q . Assum- 
ing a flat prior on \Vtb\ 2 between and 1, Fig. 15(b) 
shows that V t b > 0.36 at 95% credibility level. 



XI. SUMMARY 

We have presented the first search for s- and t- 
channel electroweak single top quark production in 
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FIG. 14: Measurement of the single top quark cross section 
production. We show the combined result in the whole 
dataset analyzed, and the result obtained in each tagging 
category. All measurements are consistent with the stan- 
dard model theoretical cross section within uncertainties. 



the ^T+jets signature. This dataset is orthogonal to 
the one used to achieve the evidence level at CDF [ll| 
and DO 

and it is sensitive to the W rv de- 
cays. Using an optimized neural network-based kine- 
matic selection and &-jet identication techniques, we 
are able to improve the s/b of the initial sample ob- 
tained with a i^T+jets trigger from about 1/10 000 
to about 1/20. We have analyzed 2.1 fb -1 of inte- 
grated luminosity recorded with the CDF II detector 
and observed an excess of signal-like events with re- 
spect to the standard model background prediction. 
The probability that the background-only hypothesis 
would produce the observed data is 1.6 x 10~ 2 (2.1 a). 
Assuming that the excess originates from single top 
quark production through s— and t— channel and a 
top quark mass of 175 GeV/c 2 we measure 



's+t 



4.9t 2 J pb. 



We use the theoretical computation of the signal cross 
section to measure the Vtb element of the CKM ma- 
trix: 



Wt 



tb 



l-24±g:|| ± 0.07(theory). 



Assuming < Vtb < 1> we set the lower limit of 
V t b > 0.36 at 95% confidence level. This analysis has 
been combined with the search performed by the CDF 
collaboration in events with an identified charged lep- 
ton plus $t plus jets signature [l3| . The combination 
of the two searches observes an excess of signal-like 
events over the background expectations at the 5cr 
level, thus estabilishing the existence of this rare pro- 
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FIG. 15: Posterior probability density of the square of 
the Vtb element of the CKM matrix, assuming an uncon- 
strained fiat prior on (a) and assuming a flat prior on 
Vt b constrained between zero and one (b). 



cess. Finally, the combination of the measurement 
presented here with the measurements in the charged 
lepton plus fir plus jets signatures by CDF and DO 
has been performed to obtain the most precise direct 
measurement of Vtb to date [5J] . 
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